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PROCESSING "OF REQUESTS FOR STATIC OBJECTS IN A NETWORK SERVER 

BACKGROUND 

1. Field of the Present Invention 

The present invention generally relates to the field of networked computing and more 
particularly to a method and system to improve server performance by handling static objects as 
special cases requiring reduced network processing. 

2. History of Related Art 

In the field of data processing networks, it is becoming increasingly common to provide 
services via servers or server systems that are accessible over a network. For purposes of this 
disclosure, a server system refers to a cluster in which a set of microprocessor-based server 
devices are connected on a local area network (LAN) to provide a common set of services to 
chent appUcations. Cost consideration may frequently motivate the manufacturers and 
purchasers of servers to implement the servers with less than state-of-the-art technology and with 
a minimum of peripheral components. In a common implementation, for example, server 
clusters are designed as a set of diskless server devices that share disk storage via some form of 
networked storage whether it be a network attached storage (NAS) device or a storage area 
network (SAN). 

Chents access servers generally in two modes, namely data retrieval and data updates. In 
data retrieval accesses, the subject of this disclosure, it is conceivable that execution may be 
dominated by network processing in which information is made suitable for transport across the 
network from a server to a chent (or vice versa). The devices attached to a network must use a 
common protocol or set of protocols to fransfer information. Network protocols are typically 
described in terms of protocol "layers", where each layer of processing is related to a specific 
fimction. The Open Systems friterconnect (OSI) working group of the International Standards 
Organization (ISO) has distributed a widely recognized model (the OSI Reference Model) of a 
prototypical network. The OSI Reference Model describes a seven-layer network protocol, from 
the physical layer comprising the physical medium and the actual electrical signals of a network, 
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to the application layer that acts as an interface between the network and application processes 
such as email. 

Perhaps the most widely implemented network protocol is the suite of protocols 
commonly referred to as the Transmission Control Protocol/Internet Protocol (TCP/IP). TCP/IP 
provides the network backbone for a large number of data processing networks including the 
Internet. Each cHent and/or server connected to the Internet (or other TCP/IP comphant 
network) must include TCP/IP code that converts datagrams or packets transmitted over the 
network to a stream of bytes interpretable by an application process. Conversely, each network 
agent must be able to convert application information into a set of one or more packets suitable 
for transporting across the network. Packets are thus required to traverse the protocol "stack" of 
layers twiU during each transaction, once as they descend the stack (from application layer to 
physical layer) at the source and again as they ascend the stack (from physical layer to 
apphcation layer) at the destination. 

Full TCP/IP processing of each network object can be tune consuming and processor 
intensive. Generating and checking checksums to facilitate error detection, allocating and 
deallocating memory for requested objects, packet framing and deframing, data copying, general 
protocol processing, and other required tasks can place a performance Umiting load on the server 
cluster. It would therefore be desirable to reduce the amount of network processing required of 
servers that use widely implemented protocols stacks like TCP/IP. 

Data objects that are retrieved by cHents may be broadly classified into static objects and 
dynamic objects. Static objects are objects or files that are fully self-contained and change 
infrequently. For instance, a static Web page is typically available as a file that need not be 
generated at request time by the server. Also, once the Web page is generated, it is typically 
available under the same Universal Resource Identifier (UTU) and rarely changes thereafter. To 
serve static objects in response to a cUent's data retrieval request, the server typically reads the 
object from its persistent or volatile storage, formats it into one or more network packets 
according to TCP/IP, and sends it to the client over the network. Thus, in a conventionally 
implemented server, static objects incur substantial overhead in network processing. If a static 
object is repeatedly accessed by clients in data retrieval mode, as is common in Web and file 
servers, the same amount of network processing is repeatedly apphed even though the resulting 
object does not usually vary from one instance to the next. With the exception of the destination 
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Internet address and some other minor protocol-related information, a packet generated from a 
static object is essentially the same each time it is generated. Because static objects represent an 
appreciable percentage of installed web pages, it would be desirable to implement a method and 
system in which the network processing of static objects is substantially reduced. 

5 

SUMMARY OF THE INVENTION 

The problems identified above are in large part addressed by a method and system for 
storing static objects, including Hypertext Transfer Protocol (HTTP) documents, in the form of a 

10 one or more IP-compliant packet trains in memory or on disk. Upon subsequent access to a 
static object, the saved packet train may be retrieved from memory or from disk and the 
subsequent network processing would merely require some updating of header information. In 
this manner, a significant portion of the network protocol processing may be eliminated when 
static objects are referenced frequently. In one embodiment, the server would include code for 

15 determining whether a referenced object is a static object. The server may fizrther include a 
directory of recently accessed static objects and a copy of the packet trains corresponding to 
these recently accessed objects. The server is preferably configured to consult the directory to 
determine if an object is static and if it has been recently accessed. If the object has been 
recently accessed, the server may retrieve the corresponding packet(s) from its system memory, 

20 otherwise the server retrieves said packets from disk. Following retrieval of the packets, the 
server may update certain header fields of the packet to reflect the current request. With the 
updated header information the packet train is then suitable for transmission over the network as 
a response to the cUent's request. It should be appreciated by those skilled in the art having the 
benefit of the disclosure that if the packets exist in memory, then the processing required by this 

25 method does not entail memory allocation, packet construction, data copying, or heavyweight 
protocol processing such as computing checksums or manipulating sockets. 

BRIEF DESCRIPTION OF THE DRAWINGS 



30 



Other objects and advantages of the invention will become apparent upon reading the 
following detailed description and upon reference to the accompanying drawings in which: 
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FIG 1 is a block diagram of selected components of a data processing network according 
to one embodiment of the present invention; 

FIG 2 is a conceptual diagram of an object database according to one embodiment of the 
invention; and 

5 FIG 3 is a flow diagram of a method for responding to client requests on a server 

according to one embodiment of the invention. 

While the invention is susceptible to various modifications and alternative forms, specific 
embodiments thereof are shown by way of example in the drawings and will herein be described 
in detail. It should be understood, however, that the drawings and detailed description presented 
10 herein are not intended to limit the invention to the particular embodiment disclosed, but on the 
contrary, the intention is to cover all modifications, equivalents, and alternatives falling within 
the spirit and scope of the present invention as defined by the appended claims, 

5 DETAILED DESCRIPTION OF THE INVENTION 

Turning now to the drawings, FIG 1 depicts selected elements of a data processing 
,g network 100 according to one embodiment of the invention. In the depicted embodiment, 
1:, network 100 includes a client 102 connected to a server cluster 103 through a network 
H intercotmection 101. Client 102 typically includes a client application program such as a web 
m 20 browser or the like running on a client device. Suitable client devices include desktop and laptop 
personal computers, network (diskless) computers and workstations, and other network aware 
devices including Internet enabled phones and personal digital assistants (PDAs). 

Network interconnection 101 may represent a wide area network (WAN) such as the 
Internet that includes a plurality of network devices including appropriate switches, routers, hubs 
25 etc., as well as the connecting media whether it is copper cable, optical fiber cable, or a wireless 
medium. Typically, each device or network connected to network interconnection 101 complies 
with and includes driver routines for a standardized network communication protocol In a 
widely applicable embodiment in which network interconnection 101 represents the Internet, for 
example, each of the connected devices and networks is TCP/IP compliant. 
30 Server cluster 103 as depicted in FIG 1 is representative of a dense server configuration 

suitable for providing services to the Internet community. Server cluster 103 includes a switch 
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104 that interfaces a number of server devices 106 to the network interconnection 101. Each of 
the servers 106 is connected to switch 106 through a local area network (LAN) 105. LAN 105 is 
typically implemented with high speed Ethernet although the invention is suitable for use with a 
variety of LAN implementations. 
5 In the depicted embodiment, LAN 105 includes networked storage 108 that represents a 

non-volatile, mass storage facility available to each server 106 in cluster 103. Networked storage 
108 typically includes one or more Network Attached Storage (NAS) devices, a fibre channel 
Storage Area Network (SAN), or a combination thereof In one embodiment, servers 106 may 
be implemented without a non-volatile storage device (i.e., hard disk) to minimize cluster cost. 
10 In the depicted embodiment, chent 102 may issue client requests to server cluster 103 via 

network interconnection 101. The client request is received by server cluster 103 and routed to 
an appropriate server 106 by switch 104. The server 106 interprets the chent request and 
O generally issues a response back to chent 102. A common example of such a sequence occurs 
when the user of client 102 selects a web address with a web browser. The browser generates an 
[=45 HTTP formatted GET request identifying the selected web address. The TCP/IP drivers on 
;F chent 102 process the HTTP request to produce a network compliant packet or set of packets that 
M includes a network address (e.g., an IP address) of server cluster 103. The packet(s) are then 
L routed to server cluster 103 and directed to a server 106 by switch 104. 

'^4 Referring now to FIG 2, selected elements of server 106 according to one embodiment of 

S20 the invention are depicted. It is to be understood that selected features of network 100, client 
102, and server 106 may be implemented as a set of computer executable instructions (software). 
Such software may be stored in a computer readable medium including volatile mediums such as 
the system memory (DRAM) or cache memory (SRAM) of server 106 as well as non-volatile 
mediums such as a magnetic hard disk, floppy diskette, CD ROM, flash memory card, DVD, 
25 magnetic tape, and the like. 

As depicted in FIG 2, server 106 includes an interpreter 202, which processes client 
requests and generates the corresponding responses. In one embodiment, interpreter 202 
includes an HTTP interpreter capable of processing HTTP formatted client requests. 
Furthermore, in one embodiment interpreter 202 may be implemented as an application-level 
30 process for maintainabihty, or it may be implemented as a kernel-level process for high 
performance. 
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Interpreter 202 works in conjunction with a network interface 204 and network 
processing code 210 to receive requests and send responses through network interconnection 101 
(through, in turn, local area network 105 and switch 104), Network processing code 210 may 
include the operating system's TCP/IP routines that transform apphcation level messages 
5 (requests and responses) into network packets and vice versa. Network processing code 210 may 
include portions of the operating system kernel code. Typically, it is desirable to access or 
invoke kernel code as seldom as possible because of the extensive context state that must be 
saved when a processor switches from executing user code to protected code. The present 
invention contemplates a reduction in context switching for selected types of requested objects. 
10 Interpreter 202 according to one embodiment of the invention is configured to determine 

when a requested object is a static object. For purposes of this disclosure, a static object refers to 
an object that is time invariant. In other words, a request for a static object always generates the 

u same responsive content. Because static objects, including static HTTP documents, represent a 
significant portion of the content that is accessible via the Internet, a method of handling requests 

^35 for such documents that improved response time may result in an appreciable improvement in 

£ overall network performance. 

2^ Server 106 includes a static object directory 208. Interpreter 202 uses the information 

stored in static object directory 208 to determine if the requested object is static, and if so, 
whether pre-formatted packets corresponding to the requested object have been previously 
J^20 generated. If the packets have been previously generated, interpreter uses the static object 
p directory information to determine the location of the pre-formatted packets corresponding to the 
object reside (e.g., local memory, local disk, or on network storage 108). If the information in 
directory 208 indicates that the requested object is static but requires processing to generate the 
pre-formatted packets, then interpreter 202 retrieves the requested object from an object cache 
25 206 or, if the object is not available in cache 206, from disk storage. After the object has been 
retrieved, interpreter 202 invokes network processing code 210 to generate a response in the 
form of a network compliant packet or set of packets (packet train) suitable for transmission to 
cHent 102. 

Upon constructing a response to a request for a static object, interpreter 202 may store the 
30 constructed packets locally (or on disk) as a separate entity referred to herein as a form of a 
response to a request for the object. Interpreter 202 may then update static object directory 208 
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to reflect that a form response containing the corresponding object exists and to indicate its 
location. The form response or pre-formatted packet train may then be retrieved when the object 
is subsequently requested by another cUent. By retrieving the form response, server 102 can 
bypass or reduce time-consuming network protocol processing and improve overall response 
5 performance. 

Although the content of a static object does not vary from one request to the next, the 
network packets that carry the content may vary. More specifically, the network protocol 
processing performed by network interface 204 and processing code 210 typically includes 
appending various headers to the underlying content (the payload) where the headers contain 
10 information needed or desirable to send the information across the network to the correct 
destination. These headers may include, for example, the IP address of the requesting client and 
a time stamp. While the content of a static object is invariant, the header field values may vary 
y with each request for the object. 

uj To address this contingency, interpreter 202 may include code that updates a form 

1^15 response to produce correct header values for the current request. This header processing is 
significantly less time consuming than the fully general network processing that occurs when a 
response is generated fi:om scratch. For instance, fully general network processing of a static 
p^. HTTP object would include the computation of a checksum based on the content of the requested 
object, the allocation of (and eventual deallocation of) memory to store the packet, copying the 
m20 data from the object cache into the allocated memory, formatting the packets according to the 
J'f protocol requirement, and the setting up of appropriate scatter/gather buffers. Scatter/gather is 
used to do Direct Memory Access (DMA) transfers of data that is written to noncontiguous areas 
of memory. Scatter/gather buffers are a Ust of vectors, each of which gives the location and 
length of one segment of the requested object. 
25 In one embodiment, interpreter 202 inserts one or more blank application layer header 

fields into the form response. When interpreter 202 processes a request, it can update the 
application layer header fields to reflect the current request. The response header fields allow 
server 106 to pass information about the response that cannot be placed in the response status 
line. These header fields give information about server 106 and about further access to the 
30 requested object. HTTP supports a finite set of response header fields as outlined in the HTTP 
specification from the Internet Engineering Task Force (IETF). See, Hypertext Transfer 
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Protocol - HTTP 1.1, RFC 2616 (IETF 1999). The HTTP response header fields may include 
an Accept-Range header field, Age header field, ETag header field. Location header field, 
Proxy- Authenticate header field, Retry- After header field, Server header field, Vary header field, 
WWW- Authenticate header field, as well as additional general header fields. By inserting one or 
5 more blank header fields in the form response, server 106 and interpreter 202 are able to take 
advantage of the pre-processing of packets as described while retaining flexibility to insert 
response-specific application header fields. If a response requires more header fields than are 
provided by the form response, interpreter 202 may opt to construct the response in the 
conventional manner using fiiUy general network protocol processing. 
10 FIG 3 presents a flow diagram illustrating a method 300 of responding to a client request for a 
specific object in a data processing network. In the depicted embodiment, server 106 receives a 
client request (block 302). The server device determines (block 304) whether a form of a 
O response containing the requested object is available to the server. If no such representation of 
,0 the response is available, a response comprising a set of one or more packets is generated (block 
[^15 306) using fiiUy general network protocol processing including the allocation of memory for the 
;P generated response, appropriate data copying, packet formatting, and the calculation of an 
appropriate checksum. In addition, the interpreter will determine (block 308) whether the 
requested object is a static object. , If the requested object is determined to be a static object, the 
'^'v! a form response is generated (block 309) fi^om the packetized response. Generating the form 
m20 response may include inserting one or more blank header fields into the formatted packets. The 
form response is then stored (block 310) in the static object database as a set of pre-formatted 
packets available for subsequent requests. The static object directory is then updated (block 311) 
to include a reference to the form response thereby enabling subsequent requests to locate and 
access the form. 

25 If the interpreter determines in block 304 by searching the static object database that an 

appropriate form response has been previously generated, the form response is located using the 
static object database and retrieved (block 312). The interpreter may then update (block 314) 
appropriate headers in the form response where the effort required to update the response headers 
is significantly less than the fiilly general network protocol processing that would otherwise 

30 occur. Ultimately, the response, whether generated from a form response or from scratch, is 
transmitted (block 316) to the chent. By storing the responses generated for static objects for 
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subsequent re-use the present invention improves response performance for a potentially 
significant portion of the client requests, e.g. chent requests for static HTTP web pages. 

It will be apparent to those skilled in the art having the benefit of this disclosure that the 
present invention contemplates a method and system for storing the responses generated for 
static objects including static HTTP pages for subsequent re-use to reduce network processing on 
subsequent requests for these objects. It is understood that the form of the invention shown and 
described in the detailed description and the drawings are to be taken merely as presently 
preferred examples. It is intended that the following claims be interpreted broadly to embrace all 
the variations of the preferred embodiments disclosed. 



